A theoretical basis to the automated detection of copying between texts, and its practical implementation in the Ferret plagiarism and collusion detector

نویسندگان

  • Caroline Lyon
  • Ruth Barrett
  • James Malcolm
چکیده

The theoretical background to the automated detection of plagiarism and collusion is investigated in this paper. We examine the underlying concepts, and see how features of language can be exploited to produce an effective system. Independently written texts have markedly different characteristics to those that include passages that have been fully or partially copied, and they can be effectively identified. The paper describes the implementation of the Ferret plagiarism and collusion detector, and its use in the University of Hertfordshire and other institutions. The difference between human and machine analysis is examined, and we conclude that an approach using machine processing is likely to be necessary in many situations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Are We Ready for Large Scale Use of Plagiarism Detection Tools?

One strategy in the prevention and detection of plagiarism and collusion is to use an automated detection tool. We argue that, for consistent treatment of students, we should be applying these tools to ALL written submissions in a given assignment rather than merely using a detection tool to confirm suspicions that a single text has been plagiarised. In this paper we describe our investigations...

متن کامل

Copy detection in Chinese documents using the Ferret: a report on experiments

The Ferret copy detector has been used for some years on English texts to find plagiarism in large collections of students’ coursework. This article reports on extending its application to Chinese, which differs from English in many respects: the sequence of characters that make up a Chinese text do not have word boundaries marked, there is a vast Chinese “alphabet”, or number of different char...

متن کامل

Copy detection in Chinese documents using the Ferret

The Ferret copy detector has been used for some years on English texts to find plagiarism in large collections of students’ coursework. This article reports on extending its application to Chinese. Corpora of coursework from two Chinese universities have been collected, and our experiments show that the Ferret can find both artificially constructed plagiarism and also actually occurring, previo...

متن کامل

Identifying and Classifying Students\' Academic Misconducts (Systematic Review)

Background: Planning for the decrease of scientific misconducts among students requires the recognition of subjects and relevant cases. The aim of the current study is determining the categories of scientific immorality among university students and categorizing them. Method: The current study is in the category of descriptive studies and data gathering is done using the systematic review meth...

متن کامل

Plagiarism checker for Persian (PCP) texts using hash-based tree representative fingerprinting

With due respect to the authors’ rights, plagiarism detection, is one of the critical problems in the field of text-mining that many researchers are interested in. This issue is considered as a serious one in high academic institutions. There exist language-free tools which do not yield any reliable results since the special features of every language are ignored in them. Considering the paucit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004